Copulas in three dimensions with prescribed 

correlations 

Luc Devroye and Gerard Letac 
April 20, 2010 

Abstract 

Given an arbitrary three-dimensional correlation matrix, we prove that 
there exists a three-dimensional joint distribution for the random variable 
(X, Y, Z) such that X,Y and Z are identically distributed with beta dis- 
tribution /3k,k{dx) on (0,1) if k > 1/2. This implies that any correlation 
structure can be attained for three-dimensional copulas. 

1 Introduction 

The simulation community is quite interested in the computer generation of 
identically distributed random vectors (Xi, . . . ,X n ) with prescribed marginal 
distribution (/i) and correlation matrix (R). We call a fixed distribution /i on 
the Borel sets of R n-universal if for every possible correlation matrix, there 
exists a joint distribution for (Xi, . . . , X n ) that achieves it, while respecting the 
marginal distribution condition. We call fi universal if it is n-universal for all n. 
For example, the standard normal law is universal: just decompose the (positive 
semi-definite) correlation matrix R into its Choleski form 5x5*. Then verify 
that if X is a column vector of n i.i.d. normal random variables, then SX has 
covariance matrix E(SXX t S t ) = 5 x 5* = R. If Y is any random variable not 
identically zero, and N is standard normal, then the normal scale mixture law 
of YN is universal — just check that YX has the same correlation matrix as X, 
where X is as above. It would be of general interest to characterize all universal 
distributions. 

The blossoming field of copulas (see, e.g., Nelsen, 2006) is largely concerned 
with similar issues, but until now it was mainly interested in n = 2 (see, e.g., 
Genest and MacKay (1986) for some early work), and, by convention, in the 
uniform marginal law fi. Marginal distributions without atoms can be mapped 
to a uniform law by the probability integral transform, and back with the inverse 
probability integral transform, hence the central role of the uniform law. These 
transforms generally alter the correlation matrix, but some transforms, such as 
between the normal and the uniform do not alter correlations a lot (see, e.g., 
Falk, 1999). This means that the universality problem has to be tackled for 
each marginal fi separately. 



For example, because of its universality, it is convenient to start with a ran- 
dom vector (Xi, . . . , X n ) with normal marginals as constructed above, achieving 
a certain correlation matrix R. Then ($(Ai), . . . , $(X„)) is a random vector 
with uniform [0, 1] marginals. Falk (1999) discusses the merits of this approach 
by noting that the maximal deviation between correlation coefficients before 
and after is at most 0.0181. One could attempt to start with a different correla- 
tion matrix R' for the normal random vector in the hope of obtaining the right 
correlation matrix R for (<E>(Ai), . . . , §>(X n )). However, for n > 3, this strategy 
is doomed to fail for some R, no matter how hard one tries in the construction 
of R': this will be shown in Section 4. 

Approximative solutions abound in the literature — some of these are sur- 
veyed in Devroye (1986). New approximations are being developed regularly, 
see, e.g., Headrick (2009). Discrete laws where the marginals are all Bernoulli 
have received particular attention (Emrich and Piedmonte (1991), Lee (1993)), 
but for any fixed marginal structure, the possible values of the 2™ "free joint 
probabilities" form a polytope, and thus, the region of allowable correlation 
matrix coefficients forms a polytope as well. As we will see below, for n > 3, 
the region of all allowable correlation matrix coefficients is convex but is not a 
polytope (see figure below for n = 3). In fact, any marginal law fi with a finite 
number of atoms is not n-universal for n > 3 for this reason. 

It is well-known that the uniform law and many other laws are 2-universal. 
The question begs whether the uniform law is universal. We do not have the 
answer to this, but the purpose of this note is to lift the veil carefully, and 
to reveal that the uniform law is 3-universal. In fact, we will show that all 
symmetric beta laws of parameter k > 1/2 are 3-universal: 



■'/. / CM = Y^x k - 1 (l-x) k - 1 l (0A) (xuI,, ,1) 



Arguing as we did above for the normal law, this implies that all symmetric 
unimodal densities are 3-universal — just observe that by Khinchine's theorem, 
each symmetric unimodal random variable can be written as YU, where U is 
uniform [—1, 1] and Y is arbitrary and independent of U. 
For p,9,rel consider the (3, 3) symmetric matrix 



R = 



1 r q 
rip 
q p 1 



(2) 



We denote by lZ n the set of semipositive definite matrices of order n with unit 
diagonal elements. By computing the principal minors one sees that R defined 
by ([2]) is in IZ3 if and only if the numbers 1 — p 2 , 1 — q 2 , 1 — r 2 and 

A = A(p, q, r) = det R = 1 - p 2 - q 2 - r 2 + 2pqr (3) 

are nonnegative. 

Recall that if X and Y are uniformly distributed on (0, 1) an abundant 
literature calls the joint distribution of (X, Y) a copula. For this reason, let us 
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call the distribution of (X±, . . . , X n ) an n-dimensional copula when X\, . . . , X n 
are uniformly distributed on (0, 1). 

2 Facts about lZ n and n-dimensional copulas 

Trivially the set lZ n is a closed convex subset of the linear space of symmetric 
matrices of order n. However the set of its extreme points is not easy to grasp 
and its characterization given by Ycart (1986) is difficult to handle for n > 4. It 
makes a sharp distinction with the cone V n of positive semi-definite symmetric 
matrices of order n, whose extremal lines are generated by matrices of rank 
one as an easy consequence of the spectral theorem for symmetric matrices. 
In the case of lZ n there are only 2" _1 matrices of rank one. These matrices 
are (ei€j)i<ij< n where ej = ±1 (they are extreme points of lZ n and they are 
the correlation matrices of X(ei, . . . , e n ) when X is a one-dimensional random 
variable with a second moment). Another notable point is the fact that positive 
definite correlation matrices are inner points of !Z n and cannot be extreme 
points: there are no extreme points of rank n. Finally, for our purposes, the 
most important result of Ycart (1986) is his characterization of the extreme 
points of lZ n which are matrices of rank 1 and 2. They are the matrices of the 
form 



where a\, . . . , a n are arbitrary numbers. 

The consideration of the extreme points of lZ n is justified by the following ob- 
servation: suppose that (Xt, . . . , X n ) ~ \i and (X[, . . . , X' n ) ~ //, suppose that 
the real random variables Xi , . . , , X n ,X[,.,., X' n are identically distributed and 
have second moment and denote by R(fi) and R(n') the respective correlation 
matrices of fi and //. Let A € [0, 1] and {X'{, X£) ~ A/x + (1 - A)//. Then 
obviously X'-' ~ X\ and the correlation matrix of the mixing satisfies 



Therefore in order to prove that for any R £ lZ n there exists a n-dimensional 
copula fx such that R = R{pi) enough is to prove it for all the cases where R 
is an extreme point of lZ n . For from the Caratheodory theorem, since lZ n is 
immersed in an affine space of dimension d — n(n — l)/2 then for any R 6 lZ n 
there exist d+ 1 extreme points Rq, . . . , Rd of lZ n and d+ 1 nonncgative numbers 
X , . . . X d of sum 1 such that R = J2j=o ^jRj- If we have been able to find for 
each extreme point Rj an n-dimcnsional copula \ij such that Rj = R{[ij) then 
from HI we get 



Needless to say these remarks extend to the case where the uniform distribution 
on (0, 1) is replaced by a distribution with finite second moments. 
Let us comment on the cases n = 2 and n = 3. 



(cos(a l - a J ))i<ij< n . 



R(\/i + (1 - A)//) = XR(fx) + (1 - \)R(jj.'). 



(4) 
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Figure 1: The space of the three off-diagonal correlation coefficients of a corre- 
lation matrix is a convex subset of [0, l] 3 . Strictly contained in it is the simplex 
formed by the four extremal points (1, 1, 1), (1,-1,-1), (—1,-1, 1), (— 1, 1, — 1). 
As a first step, we will give a construction for the three-dimensional [uniform] 
copula that achieves correlation equal to the point (—1/2,-1/2,-1/2) on the 
bulge of the surface. This surface is invariant by a 4-element group of rotations 
generated by the three symmetries with respect to the three axes. 

T 1 r 

The case n = 2. Clearly the elements of TZ 2 have the form R(r) = 
where — 1 < r < 1 and the two extreme points are i?(l) and R(— 1) since 

R(r) = ^^(l) + V^- 1 ) 

As explained above this leads to an immediate solution to the problem of finding 
a two-dimensional copula with correlation matrix R(r). 

If X is uniformly distributed, denote by /zi the distribution of (X, X), by 
the distribution of (X, — A), and by /i r the mixing fi r — -4r"Mi + ^r^-i 
leading to R(r) = i?(/i r ). Later in Section 4 we shall give another useful way 
of using Gaussian variables to design a two-dimensional copula with a given 
correlation matrix. 

The case n = 3. As seen before, the extreme points of H3 have rank 1 or 2 and 
they have the form (J2) with p = cos(«2 — 0^3), <Z = cos(a3 — ai), r — cos(ai — a^). 
For convenience we rather write a = a 2 — 013, b = — ax, c = ol\ — a2, with 

a + b + c = mod 2ir. 
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With this constraint wc denote 



R(a, b, c) 



1 cos c cos b 
cos c 1 cos a 
cos b cos a 1 



(5) 



For instance, we get the four matrices of rank one with the choices 

{a,b,c) = (0,0,0), (0,7r,7r), (tt.O.tt), (tt.tt.O). 

It is a good exercise to check A(cos a, cos 6, cos c) = with the notation ([3]). 
Note that R(a,b,c) has rank 2 if and only if (sin a, sin &, sine) ^ (0,0,0). 



3 Construction 



A GEOMETRIC CONSTRUCTION IN A PARTICULAR CASE. The aim of this section 

is to build a three-dimensional copula with the extremal correlation matrix 
R(a, b, c) defined by |(SJ| with a + b + c = mod 2ir. As explained in Section 2, 
this enables us to find a three-dimensional copula with an arbitrary correlation 
matrix R. We shall be able to do this even by replacing the uniform distribution 
Px.x by the beta distribution (3k. k defined by (Q} with k > 1/2. Since the 
solution is relatively complicated, it is desirable to work first for a special choice 
of parameters which has been for us a path toward the general case. This 
particular case is p — q = r = — 1/2 in the notation ([2]) or a = b = c = 2w/3 in 
the notation ([5|) . Since the correlation matrix is the covariance matrix of some 
affine transformations of the initial random variables, we rather consider three 
random variables (X, Y, Z) which are uniformly distributed on (— \/3, y3) — thus 
centered with variance 1. We want to construct the distribution of (X, Y, Z) such 
that 

E(XY) = E(YZ) = E(ZX) = -1/2, E(X 2 ) = E(Y 2 ) = E(Z 2 ) = 1. 

This implies that E((X + Y + Zf) = 0andX + y + Z = almost surely. The 
intersection of the plane X + Y + Z = and the cube {|X|, \ Y\, \ Z\ < y/3} is a 
hexagon. The largest disk D contained in this hexagon is centered at and has 
radius 3/V2 since it is tangent to the sides of the hexagon at six points 

±(\/3/2,V3/2,-V3), ±(V3/2,-V3, V3/2), ±(-V3, V3/2, V3/2). 

We now define an appropriate (and unique) distribution for (X, Y, Z) on D such 
that it is invariant by the rotations of the disk and such that the projections 
(X, Y,Z)^ X are uniform on (-%/3, %/3). 

An orthonormal basis of the plane X + Y + Z = is the pair of vectors 
fx = (0, l/y/2, -1/V2) and / 2 = (y/2/y/E,-l/y/E, -1/V&). Thus a distribution 
on D which is invariant by rotation is the distribution of 

4=i2(cos6/i +sin6/ 2 ) 
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where is uniform on [0, 2n) and is independent of the random variable R 6 
(0,1). This leads to 

X = V3i?sin6, Y = -V3Rsin(e- -), Z = -V3i?sin(9 + -). 

3 3 

Because O is uniform, X, Y and Z are identically distributed. Now we take R 
with distribution ^ 5 1q,i) (r)dr and we show that the distribution of i?| sin0| 
is uniform on (0,1) by computing its Mellin transform. For s > we have 

E(i? s )E(|sine| s ) = / drx- (smO^^^de (6) 

Jo VI - r 2 7T J 

1 , s 1, 1,1 s 1, 1 _ 
= -5(1 + -,- x -B - + -,-) = . □ 

2 V 2' 2 ; 7T V 2 2' 2 ; 1 + s 



Comments. We have just given an analytic proof of a theorem due to Archimedes, 
which says that if you project the uniform distribution on the three-dimensional 
sphere S2 onto a diameter, you get the uniform distribution on the diameter. 
Consider the three-dimensional sphere S constructed from the disk D above (We 
mean: having center zero and radius equal to the radius of D namely 3/\/2). 
Put the uniform distribution on S, project it orthogonally on D: this projection 
is actually the distribution of (X, Y, Z) above. 

The general case. Without loss of generality we assume that R(a, b, c) de- 
fined by ([5]) has rank two, that is, (sin a, sin b, sin c) ^ (0, 0, 0) . In the sequel we 
assume without loss of generality that sine 7^ 0. 

Let us observe that i?(a, 6, c) is the correlation matrix of the centered random 
variable (X, Y, Z) if and only if 

Xsina + Fsinfr + Zsinc^O. (7) 

To see this, observe that 

E((X sin a + Y sin b + Z sin c) 2 ) 

= sin 2 a + sin 2 b + sin 2 c + 2 (cos c sin a sin b + cos b sin a sin c + cos a sin b sin c) 
= sin 2 (b + c) + sin 2 b + sin 2 c 

+2(— coscsin(& + c) sin& — cos b sin b + c) sine + cos(6 + c) sin & sine) 

= 0. 

We imitate the previous particular case as follows: we watch the intersection 
of the plane P defined by ([7]) with the cube (—1, l) 3 (which is slightly more 
convenient for the general case than the cube(— V3) 3 that we have used 
before). This intersection is a hexagon — and not a lozenge — for the following 
reason: 
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Figure 2: Illustration of our construction. First take a point uniformly on 
the surface of the ball. Project it to the plane shown (so that it falls in the 
circle). The three coordinates of that point are each uniformly distributed on 
[0, 1]. Also, the correlation structure is given by E(XY) = E(YZ) = E(ZX) = 
-1/2, E(A 2 ) = E(F 2 ) = E(Z 2 ) = 1. 

Lemma 3.1. The plane aX + j3Y + = defines a plane P such that the 
intersection with (—1, l) 3 is a hexagon if and only if there exists a triangle with 
sides |a|, |/3|, | -y | , namely 



Proof. =X Assume that \a\ > \/3\ + \ Then the intersections of P with the 
four lines Y = e = ±1, Z = r) = ±1 are given by the four points 



which are the vertices of a lozenge. Since —1 < — ~ ^ ^ ^lifs lozenge in 
contained in the cube. The converse is similar. □ 

With a + b + c = mod 2ir we now apply the lemma to a = sin a, j3 — 
sin 6,7 = sin c. Observe that 

| sin a | = | sin(6 + c)\ = \ sin b cos c + sin c cos b\ < \ sin b\ + | sin c| 

and the two other inequalities are similar: therefore the intersection of P with 
the cube is a hexagon H. 



\a\ < + | 7 |, ^|<H + | 7 |, | 7 |<M + |0|. 
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We now construct the unique ellipse E inscribed in H . For this we introduce 
for z € (—1, 1) the ellipse E z in the (x,y) plane defined by 



E z ={{x,y) ;A(x,y,z) = 0} 

where A is defined by (|3J). This ellipse is inscribed in the square [— 1, l] 2 and 
the four points of tangency with the square are ±(z, 1) and ±(1, z). We denote 
by U z the closed convex hull of E z . 

Lemma 3.2. The projection of the ellipse E by (x, y, z) i-> (x, y) is E cosc 

Proof. Denote by E 1 the intersection of the plane P with the cylinder 

{(x, y, z) ; A(x, y, cos c) = 0}. 

We want to prove that E' is inscribed in H and thus that E — E' . Since 
the four points of tangency of E cosc with the square [— 1,1] 2 are ±(cosc, 1) 
and ±(l,cosc), observe that the unique points of the four lines ±(cosc, l,z) 
and ±(l,cosc, z) which are located on P are exactly A± — ±(1, cose, cos 6), 
and B± = ±(cosc, 1, cos a). By construction these four points are points of 
tangency of E' with H. Now observe that the two points C± = ±(cos b, cos a, 1) 
belong to E'; This comes from the fact that A(cos b, cos a, cos c) = (since the 
matrix R(a, b, c) of JU is singular) and thus C± are on the cylinder; the second 
fact is that sin a cos b + sin b cos a + sin c = and this implies that C± are on 
P. Trivially also C± are in H. For showing that C± are points of tangency 
of E' to H we have to prove that (x,y,z) € E' implies that \z\ < 1. Here 
is a not too elegant proof: we have to deduce \z\ < 1 from the two equalities 
A(x, y, cos c) = and x sin a + y sin b + z sin c = or 

22 . 2 sin a sin b 

x +y — 2xy cos c = sin c, — z = — : — x -\ — : — y. 

sin c sin c 

We make the change of variable x = -^(X + Y), y — ^(X ~ Y), which leads 
to 

X 2 Y 2 

1 = 1. 

1 — cos c 1 + COS c 



We therefore write X = VI — cose cos 9, Y = \A + cose sin 9 and we get 

c c 

x = sin(9 + -), i/ = sin(0 — -), z = — cost cos# — sinr sin# = — cos(0 — t), 
where 

/ sin a sin6\ c / sin a sin6\ c 

cos r = — h — — cos - , sm r = — : — sin - (8) 

\ sm c sm c / 2 \ sm c sm c / 2 

(we skip the proof that the squares of these two quantities add to one and that r 
does exist). Finally \z\ = \ cos(t — 6)\ < 1. To conclude the proof of the lemma, 
we observe that E' has six distinct points with H, A±, B±, C±. This implies 
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that E' is the unique ellipse inscribed in H and therefore E = E' . This ends 
the proof of the lemma. □ 

Because of the symmetry between (a, 6, c) and the previous lemma the two 
other projections of E by (x, y, z) i-» (x,z) and (x, y, z) (y,z) have respec- 
tively the equations A(x, cos b, z) = Q and A(cos a,y,z) — 0. 

We now construct a probability pk on the convex hull U of E such the three 
margins of pk are the probabilities 

vuidx) = 2l ~^ k \ l ~ x*) k -\_ hl) {x)dx 

where k > 1/2. Note that is nothing but the image of j3k by the map 
x i— > 2a; — 1. For this construction of ^ we use the following parameterization 
of U by r € (0, 1] and 9 € [0, 27r) inherited from the calculations of the previous 
lemma, where r is defined by © 

x = r sm(9 + — ) 

V = rsin(0-|) (9) 
z = — r cos(# — t). 

On the space (0, 1] x [0,2w) we define the probability for k > 1/2 
p fe (dr,d0) = (2fe-l)(l-r 2 ) fe_| rdr x — 

as well has 

p 1/2 (dr,d6) = Sx(dr) x — 

and on the space U we define the probability as the image of pk by the 
map (r,6) (x,y,z) defined by the equalities If (X, Y, Z) ~ fik we claim 
now that X ~ Y ~ Z ~ i/^. The proof by Mellin transform is quite analogous 
to For fc > 1/2 if 9 is uniform on [0, 27r) and is independent of R ~ 

(2fc - 1)(1 - r 2 ) fe_ lrl(o.i)(r)(ir then (R, 0) — p fc . If R = 1 the same is true for 
fc = 1/2. 

We show that the distribution of _R|sin8| is the distribution of \T\ when 
T r*> Vk by computing its Mellin transform. For s > we have 

<2k _ i r 1 , o t^l 2 .4-1 

E(i? s )E(|sin8| s ) = (l-r 2 ) k -?r s+1 drx - (sin9) 2 ^- 1 d9 

2 Jo * Jo 

1, „, s, 1 , 1 „ . 1 si. 

= ^-2^ 1+ 2' fc ~2) X ^2 + 2'2) 

i r(fc + i)r(^±i) 2- 8 *- 1 a + i 



T(k+^) B(k,k) 

2 -2fe 

5(fc,fc) 



/ (l-t 2 ) k - 1 t s dt = E(\T\ s ). 
Jo 
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Since R sin 9 and T are symmetric random variables we get R sin 6 ~ T. Finally 



X = Rcos(B - |) ~ Y = i?cos(9 - |) ~ Z = -i?cos(6 - r) ~ flsin6 ~ T. 
The Mellin transforms of \T\ and R have shown that 

E(X 2 ) = E(F 2 ) = E(Z 2 ) = E(i? 2 ) = ^ rT . 

Therefore, 

E(Xy) = ^lE(2 S in(e-|)sin ( e + |)) = ^. 

This shows that the correlation between X and Y is cos c. to get the correlation 
between X and Z we use Z = — sinax — SiS^y and the computation already 

sm c sin c r- ^ 

done of E(J7) and of E(X 2 ). We get easily that the correlation of (X,Z) is 
cos &. Similarly the correlation between Y and Z is cos a, and this achieves the 
proof of R(fik) = R(a,b,c). 

Comments about the two-dimensional marginals. In the general case 
we do no longer have for \xh the generalization of the beautiful interpretation 
of Hi in terms of the Archimedes theorem. However if [X, Y, Z) ~ /j^ the joint 
distributions of (X, Y), (Y, Z) and (Z, X) already appear in the literature. The 
distribution tfik, C osc{dx,dy) of (X, Y) is concentrated on the convex hull U cosc 
of the ellipse E cosc when k > 1/2 and is concentrated on the ellipse E cosc for 
k = 1/2. For k > 1/2 we have since A(x,y,cosc) = (1 — r 2 )sin 2 c 

4>k,cosc{dx,dy) = ^-^-\smc\^ k A(x,y,cosc) k ^H Ucoar (x,y)dxdy. 

This distribution 4>k, CO sc appears as a Lancaster distribution for the pair (i>k, ^fe) 
More specifically consider the sequence (Q n )^° = o 01 the orthonormal polynomials 
for the weight Vf,- Thus Q n is the Jacobi polynomial P*~ 1 ' fc ~ 1 normalized such 
that J_ 1 Q 2 n (x)v k (dx) = 1. For 1/2 < k denote 



K(x,y,z) = 



n=0 



Qn(x)Qn(y)Qn(z) 
Qn{l) 



This series converges if \x\, \y\, \z\ < 1 and its sum is zero when (x, y) is not in 
the interior U z of the ellipse E z . With this notation we have 

<t>k, cos c(dx,dy) = K(x, y, cos, c)v k {dx)v k {dy). 

This result is essentially due to Gasper (1971). See Koudou (1995) and (1996) 
and Letac (2009) for details. Needless to say, the distributions of (Y, Z) and 
(Z,X) are 4>k, COS a and 4> k , cosb . 
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4 The use of Gaussian variables for building a 
n-dimensional copula with given correlation. 

We start from the simplest idea: if 



$(x) = 



2 du 



and if X ~ iV(0, 1) we have <I?(Ar) uniform on (0, 1). Let us denote 
T(x) = T(sg) = 2V3($(x) - 1/2). 



(10) 



Then T(X) is uniform on {-y/3, VS) with mean and variance 1. Assume now 
that (X%, . . . ,X n ) is Gaussian N(0,R) with covariance R g 72„. Denote by 
.R* the covariance matrix of (T(Xi), . . . ,T(X n )) and observe that R* is also 
the correlation matrix of (T(Xi), . . . ,T{X n )) and of the n-dimensional copula 
($(Xl), . . . , $(X„)). [Copulas constructed in this manner are sometimes called 
Gaussian copulas.] In this section we compute R* as a function of R and we 
examine the image 72* of 72„ into itself of the function R H> R* . As we are 
going to see 72* is strictly smaller than 72„ for n > 3. To compute R* , we need 
the following result. 

Proposition 4.1. Let (X,Y) be a centered Gaussian variable of K 2 with co- 
1 



variance matrix 



r 

r 1 



Then 



E($(AT)$(y)) 
E(T(X)T(Y)) 



1 

2 

3 



1 r 

— argcos-, 

6 r 

— arg cos — . 



(11) 
(12) 



The proof of (fTTj) can be done by brute force and the computation of a four- 
dimensional integral. We rather going to obtain Proposition 4.1 in a more 
interesting way after the following result. 

Theorem 4.2. Let (X, Y) be a centered Gaussian variable of K 2 with covariance 
1 r 1 

and let / be a real measurable function such that K r (f(X)) = 



matrix 



1 



and ¥, r (f(X) 2 ) — 1. Consider the Hermite polynomials (7?n)^Lo defined by the 
generating function 



n=0 



-ffn(x)- 



and the expansion in orthogonal functions 



/(«) = a ' 



#n(a;) 



n=l 
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Then for all — 1 < r < 1 we have Yl™=i a n — 1 an< ^ 

oo 

E(f(X)f(Y)) = J2<r n . (13) 

n=l 



Proof. Let us compute 

E (e-"V e --V) L S -E(H n (X)H m (Y)). 

n—0 m— 

For this, write r = cos a with < a < it. If X, Z are independent centered 
real Gaussian random variables with variance 1, then Y = X cosa + Zsma is 
centered with variance 1, (X, Y) is Gaussian and E(XY) = cosa. Therefore a 
simple calculation gives 

E(e xt -^e Ys -^ = e tscosa . 

This shows that E(H n (X)H m (Y)) = if n ^ m and that E(if„(X)tf n (F)) = 
n! cos™ a. From this we get the result. □ 



Corollary 4.3. Let p n > such that Y^=iPn — 1 an d consider the generating 
function g(r) = Y^=iPnr n ■ Let R = {rij)i< itj < d be in TZ n . Then R* = 
(g( r ij))i<ij<d is the covariance and correlation matrix of the random variable 
(f(Xi), . . . , f(Xd)) where (Xi, . . . , Xd) is centered Gaussian with covariance R 
and where 

t( \ ST ^H n (x) 
f( x ) = 1^ e "V^^=T 

with e„ = ±1. 

Comment. Note that many functions / can give the same covariance R* for 
(f(Xi), . . . , f(Xd)), by taking arbitrary signs in the sequence (e n )„>i above. It 
is worthwhile mentioning that Theorem 4.2 is easily extended to p variables in 
the following sense. Consider 



ni,.. . ,n p 



^ ani -- np y V !...„„! 



and assume that an n = and that V , „ af! „ = 1. Define 
J(r (1) rW)= £ <,...,„> (1) r---(r- (p) r>. 



ni,...,n p 



Let X( fe ) = (Xj fe) ,...,^ fc) ) - N(0,RW) for fc = l,...,p be independent 
Gaussian variables of MP such that the covariance R^ = (^?)i<i,j<n is a 
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correlation matrix. Define Yj — /(xj , . . . , X. ). Then the covariance matrix 
of (Yi, . . . , Y n ) is the correlation matrix 

( 5 (rW,...,rJ ) )) 1 < lJ < n . 



Proof of Proposition 4.1. We apply Theorem 4.2 to the function f = T 
dchncd by (ITU1) . For this we have to compute 



Note that this is zero for even n since H n and T respectively even and odd 
functions. Thus we have to compute P2 n +i > and £2n+i — ±1 such that 

2n+ V(2« + l)! V (2n + l)! 
To this purpose we watch the coefficient of t n in the power expansion of 

E ( T(X)e xt " 



For this we need 

1 i ~ (_i)» t 2n+1 



E ( $( x, e -4)= $ (J,) = i + ^E^ 

/~ OO 



J2J 2 2^/Tr ^ 4 n n! 2n + 1 ' 

V 7 V 71—0 

and _ 

Therefore , 



7r ^ 4™n! 2n + 1 

n=0 



V^TT _ /3(-l) n 1 



^(2/1+1)! V 7T 4»n! 2n + 1 

which shows that £2n+i — {— 1)™- To finish the proof we apply (|13j) to et2„+i = 
^P2n+i and we get 

Q 00 1 1 r 2n+l c 

E(T(X)T(Y) = - -) n— -;f— J = 3 - - argcos-, 
7r 2 4 n n! 2n+ 1 n 2 

n=0 

the last equality being easily checked. Of course (fTTj) is deduced from (IT21 .D 
We now prove that 71% is strictly smaller than 72.3. For this we observe that 

* o 6 r nr* 

r =3 arccos - r — 2 sin . 

7T 2 6 
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Consider the matrix 



R p — 



1 V V 
P 1 P 
P P 1 



Since dcti? p = (1— p) 2 (l + 2p), the matrix R p is in TZ'^ if and only if —1/2 < p < 
1. However 2sinY^ < — 1/2 therefore -fi^sin^ cannot be a correlation matrix. 
This shows that there is no Gaussian variable (X, Y, Z) such that the correlation 
matrix of ($(A), $(T), $(Z)) is R-i/2- To see that TZ* n is strictly smaller than 
lZ n for n > 4, observe that that the block matrix diag(i?_!/ 2 , In-3) is in lZ n 
and not in 7Z1 . 
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